Prosody Modification for Vocoder Based on Amplitude Spectrum of Residual Signal

نویسندگان

  • Zhengqi Wen
  • Jianhua Tao
چکیده

This paper describes the prosody modification (pitch and duration) for vocoder based on amplitude spectrum of residual signal. In this vocoder, period component is represented as amplitude spectrum of half pitch period length and aperiod component is estimated from the difference of amplitude spectrum between the constructed period signal and the residual signal. Then, pitch modification is conducted by resampling the period spectrum according to desired pitch period length in frequency domain and duration modification is conducted by adjusting the frame shift length in time domain. Listening tests show that the speech quality of proposed vocoder after modification is not decreased so much and can get comparable performance with STRAIGHT.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new F0 modification algorithm by manipulating harmonics of magnitude spectrum

This paper proposes a new speech modification algorithm based on a vocoder framework to synthesize high quality speech. Its innovation is in preserving the fine structure of the magnitude spectrum. A key point is the use of a “compensatory gaussian window” to extract moderate F0 harmonics structures in the magnitude spectrum. The other key point is, starting from the magnitude spectrum, generat...

متن کامل

Amplitude Spectrum based Excitation Model for HMM-based Speech Synthesis

This paper describes an excitation model based on amplitude spectrum for hidden Markov model (HMM)-based speech synthesis system (HTS). Residual signal obtained from inverse filtering is decomposed into periodic and aperiodic spectrums in frequency domain. Amplitude spectrum of half pitch period length is reserved as periodic component in synthesis stage and zero-phase criterion and pitch synch...

متن کامل

Prosody Modification Using Allpass Residual of Speech Signals

In this paper, we attempt to signify the role of phase spectrum of speech signals in acquiring an accurate estimate of excitation source for prosody modification. The phase spectrum is parametrically modeled as the response of an allpass (AP) filter, and the filter coefficients are estimated by considering the linear prediction (LP) residual as the output of the AP filter. The resultant residua...

متن کامل

A new synthesis algorithm using phase information for TTS systems

New speech synthesis algorithms capable of flexible prosody (es pecially F0) modification are desired for a high quality TTS syst em. TD-PSOLA is the most popular synthesis algorithm. The al gorithm shows very high quality when F0 modification is limite d. However, the quality degradation due to pitch epoch detection error becomes severe as the F0 modification factor becomes lar ge. On the othe...

متن کامل

Designing Japanese Speech Database Cov for Hybrid Speech Sy

For the purpose of building Text-to-Speech (TTS) system that can generate high-quality and wide range speech in prosody, we conducted speech database construction. As a speech synthesizer, we use a hybrid system which consists of a unit selection module and prosody modification by STRAIGHT (vocoder type high quality analysis-synthesis method). Our viewpoint is to reduce an amount of prosody mod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011